# Lightweight Vision Model

Devstral Small Vision 2505 GGUF
Apache-2.0
Vision encoder based on Mistral Small model, supports image-text generation tasks, compatible with llama.cpp framework
Image-to-Text
D
ngxson
777
20
Lsnet B
MIT
LSNet is a family of lightweight vision models inspired by the dynamic multi-scale capabilities of the human visual system, achieving a balance between performance and efficiency across various vision tasks.
Image Classification
L
jameslahm
186
1
Sam2 Hiera Small.fb R896 2pt1
Apache-2.0
SAM2 weights (HieraDet image encoder only) based on the timm library, derived from Facebook's Hiera small model.
Image Segmentation Transformers
S
timm
67
0
Cat Emotion Classifier
Apache-2.0
Fine-tuned version of Google's ViT model for cat emotion classification
Image Classification Transformers
C
semihdervis
54
2
Autotrain Test 41086106044
A multi-class image classification model trained using AutoTrain, capable of classifying common objects
Image Classification Transformers
A
Younesao
16
0
Swin Tiny Patch4 Window7 224 Finetuned Eurosat
Apache-2.0
An image classification model based on the Swin Transformer Tiny architecture, fine-tuned on the CIFAR10 dataset with an accuracy of 97.24%
Image Classification Transformers
S
eric1993
16
0
Autotrain Ex And Pt 3122688390
A multi-class image classification model trained using AutoTrain, capable of recognizing and classifying various common objects
Image Classification Transformers
A
Lloviant
17
0
Autotrain Ex And Pt 3122688386
This is a multi-class image classification model trained using AutoTrain, capable of recognizing common objects such as tigers, teapots, and palaces.
Image Classification Transformers
A
Lloviant
17
0
Swin Tiny Patch4 Window7 224 Finetuned Eurosat
Apache-2.0
Image classification model fine-tuned on the CIFAR10 dataset based on Swin Transformer Tiny architecture
Image Classification Transformers
S
gneuert
18
0
Vit Base Patch16 224 Finetuned
Apache-2.0
An image classification model fine-tuned based on Google's Vision Transformer (ViT), trained on custom image datasets
Image Classification Transformers
V
clp
30
0
Swin Tiny Patch4 Window7 224 Finetuned Woody LeftGR 130epochs
Apache-2.0
Image classification model based on Swin Transformer Tiny architecture, fine-tuned for 130 epochs on a specific image dataset
Image Classification Transformers
S
Alex-VisTas
12
0
Levit 192 Finetuned On Unlabelled IA With Snorkel Labels
Apache-2.0
This model is a fine-tuned version of facebook/levit-192 on an unlabeled dataset, demonstrating excellent performance in precision, recall, F1 score, and accuracy.
Image Classification Transformers
L
ImageIN
19
0
Swin Tiny Finetuned Dogfood
Apache-2.0
A dog food image classification model fine-tuned based on Swin Transformer Tiny architecture, achieving 98.8% accuracy on the test set
Image Classification Transformers
S
sasha
15
1
Snacks Classifier
A lightweight image classification model based on Microsoft's Swin Transformer Tiny architecture, achieving 92.86% test accuracy after fine-tuning on a snack classification dataset
Image Classification Transformers
S
Matthijs
15
0
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase